NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

ML4SODA: A Decision Tree Guided Design Space Exploration for Fast and High Quality MLIR-based HLS

https://doi.org/10.1145/3716368.3735223

Manjunath, Darshith; Agostini, Nicolas Bohm; Tumeo, Antonino; Zhang, Jeff; Chakrabarti, Chaitali (June 2025, ACM)

Full Text Available
FP-SMR: A Fully Digital Floating-Point Processing-in-SAS-MRAM for Session-based Recommender System

https://doi.org/10.1145/3716368.3735206

Ali, Asmer Hamid; Sridharan, Amitesh; Guo, Cheng; Hwang, William; Tsai, Wilman; Zhang, Jeff; Chen, Yiran; X_Wang, Shan; Fan, Deliang (June 2025, ACM)

With the rapid advancement of DNNs, numerous Process-in-Memory (PIM) architectures based on various memory technologies (Non-Volatile (NVM)/Volatile Memory) have been developed to accelerate AI workloads. Magnetic Random Access Memory (MRAM) is highly promising among NVMs due to its zero standby leakage, fast write/read speeds, CMOS compatibility, and high memory density. However, existing MRAM technologies such as spin-transfer torque MRAM (STT-MRAM) and spin-orbit torque MRAM (SOT-MRAM), have inherent limitations. STT-MRAM faces high write current requirements, while SOT-MRAM introduces significant area overhead due to additional access transistors. The new STT-assisted-SOT (SAS) MRAM provides an area-efficient alternative by sharing one write access transistor for multiple magnetic tunnel junctions (MTJs). This work presents the first fully digital processing-in-SAS-MRAM system to enable 8-bit floating-point (FP8) neural network inference with an application in on-device session-based recommender system. A SAS-MRAM device prototype is fabricated with 4 MTJs sharing the same SOT metal line. The proposed SAS-MRAM-based PIM macro is designed in TSMC 28nm technology. It achieves 15.31 TOPS/W energy efficiency and 269 GOPS performance for FP8 operations at 700 MHz. Compared to state-of-the-art recommender systems for the same popular YooChoose dataset, it demonstrates a 86 ×, 1.8 ×, and 1.12 × higher energy efficiency than that of GPU, SRAM-PIM, and ReRAM-PIM, respectively.
more » « less
Full Text Available
CLAIRE: Composable Chiplet Libraries for AI Inference

https://doi.org/10.23919/DATE64628.2025.10992960

Nalla, Pragnya Sudershan; Haque, Emad; Liu, Yaotian; Sapatnekar, Sachin S; Zhang, Jeff; Chakrabarti, Chaitali; Cao, Yu (March 2025, IEEE)

Full Text Available
A 16nm Heterogeneous Accelerator for Energy-Efficient Sparse and Dense AI Computing

https://doi.org/10.1145/3665314.3670824

Raveendran_Nair, Gopikrishnan; Jiang, Fengyang; Zhang, Jeff; Cao, Yu (August 2024, ACM)

Full Text Available
SAFER: Sparsity Integrated Compute-in-Memory AI Accelerator with a Fused Dot-Product Engine and a RISC-V CPU

https://doi.org/10.1109/ESSERC66193.2025.11214049

Sridharan, Amitesh; Ali, Asmer Hamid; Lee, Yongjae; Anupreetham, Anupreetham; Liu, Yaotian; Zhang, Jeff; Seo, Jae-sun; Fan, Deliang (September 2025, IEEE)

Full Text Available
HISIM: Analytical Performance Modeling and Design Space Exploration of 2.5D/3D Integration for AI Computing

https://doi.org/10.1109/TCAD.2025.3531348

Wang, Zhenyu; Nalla, Pragnya Sudershan; Sun, Jingbo; Goksoy, A Alper; Mandal, Sumit K; Seo, Jae-sun; Chhabria, Vidya A; Zhang, Jeff; Chakrabarti, Chaitali; Ogras, Umit Y; et al (January 2025, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems)

Full Text Available
Intelligent Networking for Energy Harvesting Powered IoT Systems

https://doi.org/10.1145/3638765

Zhang, Wen; Pan, Chen; Liu, Tao; Zhang, Jeff Jun; Sookhak, Mehdi; Xie, Mimi (March 2024, ACM Transactions on Sensor Networks)

As the next-generation battery substitute for IoT system, energy harvesting (EH) technology revolutionizes the IoT industry with environmental friendliness, ubiquitous accessibility, and sustainability, which enables various self-sustaining IoT applications. However, due to the weak and intermittent nature of EH power, the performance of EH-powered IoT systems as well as its collaborative routing mechanism can severely deteriorate, rendering unpleasant data package loss during each power failure. Such a phenomenon makes conventional routing policies and energy allocation strategies impractical. Given the complexity of the problem, reinforcement learning (RL) appears to be one of the most promising and applicable methods to address this challenge. Nevertheless, although the energy allocation and routing policy are jointly optimized by the RL method, due to the energy restriction of EH devices, the inappropriate configuration of multi-hop network topology severely degrades the data collection performance. Therefore, this article first conducts a thorough mathematical discussion and develops the topology design and validation algorithm under energy harvesting scenarios. Then, this article developsDeepIoTRouting, a distributed and scalable deep reinforcement learning (DRL)-based approach, to address the routing and energy allocation jointly for the energy harvesting powered distributed IoT system. The experimental results show that with topology optimization,DeepIoTRoutingachieves at least 38.71% improvement on the amount of data delivery to sink in a 20-device IoT network, which significantly outperforms state-of-the-art methods.
more » « less
Full Text Available
Invited: EDA for Heterogeneous Integration

https://doi.org/10.1109/DAC63849.2025.11132516

Haque, Emad; Nalla, Pragnya; Sudarshan, Chetan Choppali; Yogi, Divya; Zhang, Hangyu; Chakrabarti, Chaitali; Chhabria, Vidya A; Harjani, Ramesh; Zhang, Jeff; Sapatnekar, Sachin S (June 2025, IEEE)

Full Text Available
Path Planning Under Uncertainty to Localize mmWave Sources

https://doi.org/10.1109/ICRA48891.2023.10160524

Pfeiffer, Kai; Jia, Yuze; Yin, Mingsheng; Veldanda, Akshaj Kumar; Hu, Yaqi; Trivedi, Amee; Zhang, Jeff; Garg, Siddharth; Erkip, Elza; Rangan, Sundeep; et al (May 2023, IEEE)

Full Text Available
Energy-Efficient Brain-Inspired Hyperdimensional Computing Using Voltage Scaling

https://doi.org/10.23919/DATE54114.2022.9774697

Zhang, Sizhe; Wang, Ruixuan; Ma, Dongning; Zhang, Jeff Jun; Yin, Xunzhao; Jiao, Xun (March 2022, 2022 Design, Automation & Test in Europe Conference & Exhibition (DATE))

Full Text Available

« Prev Next »

Search for: All records